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1 Shape-based ret rieval an d anal ysis o f 3D mode ls |j§ 
Thomas Funkhouser, Michael Kazhdan 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes 
SIGGRAPH '04 

Publisher: ACM Press 

Full text available: *jl ]pdfn2.56 MB) Additional Information: full citation, abstract 

Large repositories of 3D data are rapidly becoming available in several fields, including 
mechanical CAD, molecular biology, and computer graphics. As the number of 3D models 
grows, there is an increasing need for computer algorithms to help people find the 
interesting ones and discover relationships between them. Unfortunately, traditional text- 
based search techniques are not always effective for 3D models, especially when queries 
are geometric in nature (e.g., find me objects that fit into thi ... 

2 imageRMrjeM^ Issues. Te chniq ues, a nd S ystems H 
£^ M. L. Kherfi, D. Ziou, A. Bernardi 

▼ March 2004 ACM Computing Surveys (CSUR), Volume 36 issue 1 
Publisher: ACM Press 

Full text available: ^ssttSStSJSJSSl Additional Information: full citation, afesfcact, references, index terms 

With the explosive growth of the World Wide Web, the public is gaining access to massive 
amounts of information. However, locating needed and relevant information remains a 
difficult task, whether the information is textual or visual. Text search engines have 
existed for some years now and have achieved a certain degree of success. However, 
despite the large number of images available on the Web, image search engines are still 
rare. In this article, we show that in order to allow people to profi ... 

Keywords: Image-retrieval, World Wide Web, crawling, feature extraction and selection, 
indexing, relevance feedback, search, similarity 
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The problem of searching the elements of a set that are close to a given query element 
under some similarity criterion has a vast number of applications in many branches of 
computer science, from pattern recognition to textual and multimedia information 
retrieval. We are interested in the rather general case where the similarity criterion 
defines a metric space, instead of the more restricted case of a vector space. Many 
solutions have been proposed in different areas, in many cases without cros ... 

Keywords: Curse of dimensionality, nearest neighbors, similarity searching, vector 
spaces 



4 



Experiences with selecting search engines using metasearch 
Daniel Dreilinger, Adele E. Howe 

July 1997 ACM Transactions on Information Systems (TOIS), Volume is issue 3 
Publisher: ACM Press 

Full text available* flpdfC4?3 65 KBj AdditionaI Information: full citation, abstract, references, citings, index 

Search engines are among the most useful and high-profile resources on the Internet. 
The problem of finding information on the Internet has been replaced with the problem of 
knowing where search engines are, what they are designed to retrieve, and how to use 
them. This article describes and evaluates SavvySearch, a metasearch engine designed to 
intelligently select and interface with multiple remote search engines. The primary 
metasearch issue examined is the importance of carefully selecti ... 

Keywords: WWW, information retrieval, machine learning, search engine 
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w Publisher: ACM Press 
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The success of text-based retrieval motivates us to investigate analogous techniques 
which can support the querying and browsing of image data. However, images differ 
significantly from text both syntactically and semantically in their mode of representing 
and expressing information. Thus, the generalization of information retrieval from the text 
domain to the image domain is non-trivial. This paper presents a framework for 
information retrieval in the image domain which supports content-based q ... 

Keywords: clustering, codebook, content-based image retrieval, keyblock 
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Finding the longest matching prefix from a database of keywords is an old problem with a 
number of applications, ranging from dictionary searches to advanced memory 
management to computational geometry. But perhaps today's most frequent best 
matching prefix lookups occur in the Internet, when forwarding packets from router to 
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router. Internet traffic volume and link speeds are rapidly increasing; at the same time, a 
growing user population is increasing the size of routing tables against which p ... 

Keywords: collision resolution, forwarding lookups, high-speed networking 
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V November 2003 ACM Transactions on Internet Technology (TOIT), volume 3 issue 4 

Publisher: ACM Press 
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Accessing and processing distributed data sources have become important factors for 
businesses today. This is especially true for the emerging virtual enterprises with their 
data and processing capabilities spread across the Internet. Unfortunately, however, 
query processing on the Internet is not predictable and robust enough to meet the 
requirements of many business applications. For instance, the response time of a query 
can be unexpectedly high; or the monetary cost might be too high if the ... 

Keywords: Quality of Service 
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Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 
Studies on Collaborative research 

Publisher: IBM Press 

Full text available: ||)MK4v2±MB}. Additional Information: ftiJlciiatioa, abstract, references, [ndexjerms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the 
execution of the application. The visualization tool we use is Poet, an event tracer 
developed at the University of Waterloo. However, these diagrams are often very complex 
and do not provide the user with the desired overview of the application. In our 
experience, such tools display repeated occurrences of non-trivial commun ... 

An adaptive information retrieval system based on associative networks 
Helmut Berger, Michael Dittenbach, Dieter Merkl 

January 2004 Proceedings of the first Asian-Pacific conference on Conceptual 
modelling - Volume 31 APCCM '04 

Publisher: Australian Computer Society, Inc. 

Full text available: ^.dMM§.33..\S3.1 Additional Information: MLvMtton, abstract references 

In this paper we present a multilingual information retrieval system that provides access 
to Tourism information by exploiting the intuitiveness of natural language. In particular, 
we describe the knowledge representation model underlying the information retrieval 
system. This knowledge representation approach is based on associative networks and 
allows the definition of semantic relationships between domain-intrinsic information items. 
The network structure is used to define weighted associatio ... 

Keywords: associative networks, constrained spreading activation, knowledge 
representation, natural language information retrieval 
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September 2005 ACM Transactions on Database Systems (TODS), volume 30 issue 3 
Publisher: ACM Press 

Full text available: ||| pdf{1 A2 MB) Additional Information: full citation, abstract references, index terms 

Query optimizers in object-relational database management systems typically require 
users to provide the execution cost models of user-defined functions (UDFs). Despite this 
need, however, there has been little work done to provide such a model. The existing 
approaches are static in that they require users to train the model a priori with 
pregenerated UDF execution cost data. Static approaches can not adapt to changing UDF 
execution patterns and thus degrade in accuracy when the UDF executions u ... 

Keywords: /C-nearest neighbors, cost modeling, object relational DBMS, quadtree, query 
optimization, self-tuning 
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Full text available: ||) pdf(29Q.21 KB) Additional Information: full citation, abstract, references. Index terms 

We present a complete and efficient framework for video shot indexing and retrieval. 
Video shots are described by their key-frame, themselves described by their regions. 
Region-based approaches suffer from the complexity of segmentation and comparison 
tasks. A compact region-based shot representation is usually obtained thanks to vector- 
quantization method. We thus introduce LSA to reduce the noise inherent to the 
segmentation and the quantization processes. Then to better capture the content ... 

Keywords: latent semantic analysis, region clustering, region similarity, region-based 
video retrieval, video analysis 




12 Feature-based similarity search in 3D object databases 

|k Benjamin Bustos, Daniel A. Keim, Dietmar Saupe, Tobias Schreck, Dejan V. Vranic 

December 2005 ACM Computing Surveys (CSUR), volume 37 issue 4 

Publisher: ACM Press 

Full text available: | |] pdf(5.29 MB) Additional Information: full citation, abstract, references. Index terms 

The development of effective content-based multimedia search systems is an important 
research issue due to the growing amount of digital audio-visual information. In the case 
of images and video, the growth of digital data has been observed since the introduction 
of 2D capture devices. A similar development is expected for 3D data as acquisition and 
dissemination technology of 3D models is constantly improving. 3D objects are becoming 
an important type of multimedia data with many promising appl ... 

Keywords: 3D model retrieval, content-based similarity search 
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Level set methods, an important class of partial differential equation (PDE) methods, 
define dynamic surfaces implicitly as the level set (iso-surface) of a sampled, evolving nD 
function. The course begins with preparatory material that introduces the concept of using 
partial differential equations to solve problems in computer graphics, geometric modeling 
and computer vision. This will include the structure and behavior of several different types 
of differential equations, e.g. the level set eq ... 

14 Subspace clustering for high dimensional data: a review 
& Lance Parsons, Ehtesham Haque, Huan Liu 

V* June 2004 ACM SIGKDD Explorations Newsletter, Volume 6 issue l 
Publisher: ACM Press 

Full text available: ^ odf(539. 13 K3) Additional Information: full citation, abstract, references 

Subspace clustering is an extension of traditional clustering that seeks to find clusters in 
different subspaces within a dataset. Often in high dimensional data, many dimensions 
are irrelevant and can mask existing clusters in noisy data. Feature selection removes 
Irrelevant and redundant dimensions by analyzing the entire dataset. Subspace clustering 
algorithms localize the search for relevant dimensions allowing them to find clusters that 
exist in multiple, possibly overlapping subspaces. The ... 

Keywords: clustering survey, high dimensional data, projected clustering, subspace 
clustering 



igiiigii 



15 Technique for automatically correcting words in text 
Jj^ Karen Kukich 

>^ December 1992 ACM Computing Surveys (CSUR), volume 24 issue 4 
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Research aimed at correcting words in text has focused on three progressively more 
difficult problems:(l) nonword error detection; (2) isolated-word error correction; and (3) 
context-dependent work correction. In response to the first problem, efficient pattern- 
matching and n-gram analysis techniques have been developed for detecting strings that 
do not appear in a given word list. In response to the second problem, a variety of 
general and application-specific spelling cor ... 

Keywords: n-gram analysis, Optical Character Recognition (OCR), context-dependent 
spelling correction, grammar checking, natural-language-processing models, neural net 
classifiers, spell checking, spelling error detection, spelling error patterns, statistical- 
language models, word recognition and correction 
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In this paper we review studies of the growth of the Internet and technologies that are 
useful for information search and retrieval on the Web. We present data on the Internet 
from several different sources, e.g., current as well as projected number of users, hosts, 
and Web sites. Although numerical figures vary, overall trends cited by the sources are 
consistent and point to exponential growth in the past and in the coming decade. Hence it 
is not surprising that about 85% of Internet user ... 
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Keywords: Internet, World Wide Web, clustering, indexing, information retrieval, 
knowledge management, search engine 
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Data sets in large applications are often too massive to fit completely inside the 
computers internal memory. The resulting input/output communication (or I/O) between 
fast internal memory and slower external memory (such as disks) can be a major 
performance bottleneck. In this article we survey the state of the art in the design and 
analysis of external memory (or EM) algorithms and data structures, where the goal is to 
exploit locality in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 



19 Student best paper contest: Confidence-based dynamic ensemble for image 

annMatjon.and semaQt|cs.discoyery. 
Beitao Li, Kingshy Goh 
November 2003 Proceedings of the eleventh ACM international conference on 

Multimedia 
Publisher: ACM Press 

Full text available- odf(27fi 37 K3 ) Additional Information: Ml .citation, abstract, reM^apes, citings, index 
' 1 — " terms 

Providing accurate and scalable solutions to map low-level perceptual features to high- 
level semantics is critical for multimedia information organization and retrieval. In this 
paper, we propose a confidence-based dynamic ensemble (CDE) to overcome the 
shortcomings of the traditional static classifiers. In contrast to the traditional models, CDE 
can make dynamic adjustments to accommodate new semantics, to assist the discovery 
of useful low-level features, and to improve class-prediction ... 
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V June 2002 Proceedings of the twenty-first ACM SIGMOD-SXGACT-SIGART symposium 
on Principles of database systems 
Publisher: ACM Press 
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In this overview paper we motivate the need for and research issues arising from a new 
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model of data processing. In this model, data does not take the form of persistent 
relations, but rather arrives in multiple, continuous, rapid, time-varying data streams. In 
addition to reviewing past work relevant to data stream systems and current projects in 
the area, the paper explores topics in stream query languages, new requirements and 
challenges in query processing, and algorithmic issues. 
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